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-  Mr.  Jbrry  Klion,\3ri)8Eiss  Air  Force  Base,  p/><ne,  ]STew  Ybrk,  posed  the  following  prob¬ 
lem?* The  Air  Force  is  Considering  the  contract  frenewal'Application  of  a  civilian  contractor 
hired  to  maintain  in  workingvorder  a  series  of  /adar  stations.  The  measure  of  performance 
of  interest  is,  T,  the  time  that  a  particular  station  is  not  ‘onHipe’  while  being  down  for 
repair.  The  Contract  stipulatesXthat  repair  4erVice  will  be  such  tii&Kon  the  average  50% 
of  all  repairs  will  be  completed  b\fore  Li  hours  and  90%  of  all  repairs  sh^ll  be  completed 
before  Lt  hours.  It  also  states  thht  the  repair  contract  will  be  renewed  oh' the  basis  of 
a  decision  rule 'that  errs  by  failing  t\r/new  when  the  case  is  that  the  contract  s'htrahLbe 
renewed  with  a  probability'  of  <*.  Th^,  renewal  of  the  contract  depends  on  the  making  a 
decision  based  on  N  repair  times,  TY,!},  —  *  ,  Xjv,  of  the/contractor  as  to  whether  or  not  Lj 
is  at  least  the  50**  percentile  and  L%  is  ftt  least  the  90**  percentile  of  -F(-),  the  distribution 
function  F(.)  of  these  repair  times.  /The  usual  test  based  on  the  binomial  distributions^ 
of  the  number  of  repairs  before  Lj/and  the  number  of  repairs  before  L$  suffers  from  two 
problems!  Jl)  Its  true  size  is  at  times  far  from  the  nominal  sizejand^)  because  of  the 
discrete  of  the  random  variables,  cannot  be  performed  at  the  stipulated  size.  This  paper 
proposes  the  use  of  the  likelihood  ratio  test  based  on  the  nultinomial  joint  distribution  of 

the  number  of  repairs  before  Ly  and  the  number  of  repairs  before  L 2.  f  ,  X"— . - 

More  generally,  a  likelihood  ratio  test  to  simultaneously  test  K  hypotheses  on  cumu¬ 
lative  sums  of  multinomial  parameters  is  described  here.  An  algorithm  is  supplied  for 
the  evaluation  of  this  statistic.  It  is  shown  that  this  test  procedure  allows  for  a  test  at 
approximately  the  correct  size  that  has  uniformly  better  power  with  respect  to  test  that 
is  the  natural  competitor.  "  -  •  4.''  <- - — 


Key  Words:  hypothesis  testing,  multinomial,  likelihood  ratio,  quantiles 


The  Problem  and  its  Notation 


The  following  problem  was  posed:  one  of  the  armed  forces  is  considering  the  contract 
renewal  application  of  a  civilian  contractor  hired  to  maintain  in  working  order  a  series  of 
radar  stations.  The  measure  of  performance  of  interest  is  T,  the  time  that  a  particular 
station  is  not  ‘on  line’  while  being  down  for  repair.  The  contract  stipulates  that  repair 
service  will  be  such  that  50%  of  all  repairs  will  be  completed  before  L\  hours  and  90% 

1  Research  partially  sponsored  by  the  U.S.  Air  Force  Office  of  Scientific  Research  under 
AFOSR  Grant  88-0400. 

2 Currently  at  Merck  Sharpe  &  Dohme  Research  Labs. 
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of  all  repairs  shall  be  completed  before  L 2  hours.  It  also  states  that  the  repair  contract 
will  be  renewed  on  the  basis  of  a  decision  rule  that  qrrs  by  failing  to  renew  when  the 
case  is  that  the  contract  should  be  renewed  with  a  probability  of  a.  The  renewal  of  the 
contract  depends  on  the  making  a  decision  based  on  N  repair  times,  T\ ,  Ti ,  •  •  • ,  Ts ,  of  the 
contractor  as  to  whether  or  not  L\  is  at  least  the  50th  percentile  and  L 2  is  at  least  the 
90t/l  percentile  of  P(-),  the  distribution  function  of  these  repair  times.  In  this  situation, 
N  the  number  of  repair  times  may  be  small,  usually  less  than  20. 

In  more  general  terms,  a  contract  might  stipulate  performance  criteria  on  K  quantiles 
(K  >  2)  of  a  distribution  function  P(-)  specified  by  a  set  of  time  periods,  { L\,Li ,  •  •  • ,  Lk  }, 

L\  <  L-j,  <  •  •  •  <  Lk ,  and  a  set  of  probabilities,  {po  i,P0  2>-  •  -.Potf},  0  <  !Cl=i  Po*  <  1  for 
1  <  *  <  K.  To  simplify  the  notation,  let  p,-  =  F(L,-)  —  F(L,_i)  and 

3  3 

Pf  =  p"  and  Po\  ~  E  POv  for  *  -*■ 

When  the  subscript  :  is  0,  it  will  be  suppressed: 

Pj  =  P3o  and  PQ3'  =  P0}0. 

On  the  basis  of  N  observations,  the  interest  is  in  simultaneously  testing  the  following 
K  hypotheses  with  a  specified  type  I  error  rate  a: 

Hq  :  P‘  >P0‘,  »  =  I,2,...,K.  (1) 


The  general  problem  is  to  simultaneously  test  K  one-sided  hypotheses  on  cumulative  sums 
of  parameters  of  a  multinomial  distribution.  Although  the  discussion  here  focuses  on  K 
hypotheses  stipulating  a  greater  than  or  equal  (>)  relationship,  the  techniques  illustrated 
here  can  also  be  applied  to  hypotheses  in  the  opposite  direction  (<). 


The  Standard  Approach  and  its  Limitations 

For  u  =  1,*  let  Nv  =  the  number  of  repair  times  that  fall  between  Lv  and  j 
with  Lo  =  0-  More  generally  Nu  is  the  number  of  observations  falling  in  the  vlh  cell  of  a 
multinomial  distribution  with  parameter  pi,..  The  quantity  Nv  is  a  binomial  random 

variable  with  parameter  P‘,  1  =  !,-••, K.  A  reasonable  approach  for  constructing  a  test  of 
the  hypothesis  in  (l)  is  employ  the  Union-Intersection  technique  :  perform  K  tests  on 
the  P‘,  i  =  1,2 The  null  hypothesis  is  rejected  i  lor  any  »,  1  <  t  <  K,  ^  < 

n°.  The  critical  values  are  either  determined  from  the  '  jmial  distribution  or  from  the 
normal  approximation  to  the  binomial  such  that  for  i  =  1,  •  •  • ,  Kt 


< {P <  B°)  }  =  ‘  (E  n-  <  n°)  = 

Here  a,-  is  the  error  rate  or  size  for  testing  the  individual  hypothesis  HJ,.  The 
is  be  selected  in  such  a  way  to  make  the  overall  experiment-wise  error  rate  conform  to 
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the  contract-  specified  error  level  a  .  This  choice  of  the  n°  can  utilize  the  Bonferroni 
Inequality: 


'-,=1  *^=1  >  J  1—1  ^=1  >  1=1 


(2) 


When  the  null  hypothesis  is  true  and  ^,=1  a,-  =  a,  the  specified  experiment-wise  error 
rate  is  bounded  above  by  a. 

In  situations  in  which  we  are  testing  a  hypothesis  on  K  >  3  quantiles,  for  small 
samples  the  union-intersection  procedure  gives  the  decision  maker  little  guidance  in  either 
constructing  a  critical  region  or  in  estimating  the  error  level.  For  example,  below  is  a 
portion  of  the  binomial  tables  for  N  —  8. 


Table  1:  Abbreviated  Binomial  Table  for  N  =  8. 


n  \  p 

.20 

.40 

.60 

.80 

0 

.167 

.017 

.001 

.000 

1 

.503 

.106 

.009 

.000 

2 

.797 

.315 

.050 

.001 

3 

.944 

.594 

.174 

.010 

4 

.990 

.826 

.406 

.056 

5 

.999 

.950 

.685 

.203 

6 

1.000 

.992 

.894 

.497 

7 

1.000 

.999 

.983 

.832 

8 

1.000 

1.000 

1.000 

1.000 

In  testing  the  hypothesis 

Po  >  -20,  P02  >  .40,  P03  >  .60,  and  P04  >  .80  at  a  =  .05, 

the  cautious  decision-maker,  using  the  Union-Intersection  technique,  would  feel  obliged 
to  reject  if  -Wi/  =  0  for  *  =  1,2,3,  or  4,  knowing  only  that  the  type  I  error  rate  is 

bounded  above  by  .167  +  .017  +  .001  =  .185.  A  selection  of  any  other  critical  region  would 
more  than  likely  rule  out  any  test  at  all  on  the  20**  percentile  and  call  for  a  test  based 
only  on  (n2,n3,n4).  This  reduces  to  a  test  on  three  proportions.  But  we  wanted  to  test 
four!  One  concludes  that  for  small  samples,  where  the  probability  of  1  ‘success’  at  P1  is 
greater  than  the  required  size  of  the  test,  the  union-intersection  technique  is  inadequate. 

A  Generalized  Likelihood  Ratio  Test  (GLRT) 

From  (l)  we  note  that  the  hypothesis  involves  a  test  of  multinomial  parameters. 
The  observed  vector  [Ni,N2,"’,Nk)  is  distributed  as  a  /f-nomial  random  vector  with 
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parameter  (pi,P2i •  *  •  ,p/c)  in  the  parameter  space 


fl=  jp€P*  |0<  £p„  <  1,  pt-  >0,  1  <»'  <#]. 

u-\  J 


The  null  hypothesis  tests  if  p  €  flo  with 


n0  =  {pe  n  |  P‘  >  p0\  i<i<  k}. 

The  vector  (p0 1 , poa>  ‘  > P0 k)  characterizes  the  population  with  respect  to  the  K  disjoint 

time  intervals  Lt-  —  T,_i,  *  =  1, in  the  sense  that  P0l  is  the  lower  bound  for  the 
hypothesized  proportion  of  repair  times  occurring  before  L,-. 

An  example  for  K  =  2  is  illustrative. 

Example  i. 

The  parameter  space  is 


n  =  {(pi,p2)  I  o  <  pi  <  i,  o  <  P2  <  i,  p2  <  i}. 

Partition  Q  into 

Ho  =  { (pi i P2)  i  Pi  >  poll  -P2  >  -Po2}  and  Qq  =  H  -  n0. 
The  null  hypothesis  space,  Hq,  is  shaded  in  Figure  1. 


Figure  1  Partitioning  the  Parameter  Space  when  K  =  2 
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We  wish  to  test 


H0  :  (pi » P2)  €  H0  versus  Hx  :  (p‘i,p2)  € 


A  natural  suggestion  for  testing  this  hypothesis  is  to  employ  a  generalized  likelihood 
ratio  test  (GLRT).  Many  statistical  tests  are  based  on  the  generalized  likelihood  ratio 
approach  and  there  is  a  large  body  of  statistical  theory  in  support  of  such  procedures. 
The  large  sample  theory  for  the  particular  application  here  was  first  presented  by  Chernoff 
(1954).  Extensions  to  the  Chernoff  result  can  be  found  in  Feder  (1968)  ,and  Self  and  Liang 
(1987).  A  recent  application  can  be  found  in  Greenberg  (1985).  While  Feder  and  Self 
and  Liang  address  general  properties  of  this  statistic  in  the  large  sample  case,  Greenberg 
discusses  a  specific  application  similar  to  the  problem  addressed  here.  Greenberg’s  interest 
is  to  simultaneously  test  one-  sided  hypotheses  on  K  —  1  multinomial  parameters  from  a 
if-nomial  distribution.  In  terms  of  (l)  Greenberg  wishes  to  test 

Hi:  P<>P0i,i=2,--,K.  (1') 

Although  the  difference  between  (1)  and  (l’)  is  only  the  lack  of  interest  in  what  happens  in 
the  first  cell,  the  solution  to  the  maximization  problem  is  made  more  tractable  as  Greenberg 
shows.  Here  we  consider  the  larger  hypothesis  (l)  and  construct  the  test  procedure  when 
applied  to  small  sample  case  ( N  <  20). 

As  before,  N  denotes  the  total  number  observations  and  let 


N  =  {NuN2y--NK}. 


The  generalized  likelihood  ratio  used  here  is  defined  to  be 

max  L(N,p)  K  ,  \ 

AW  =  Wn°  ,,  ,■  wW  L(N,pl  =  II  &  1  ~  P*  ) 
max  L[N ,p)  x  \  J 


Denote 

^log(A(iV))  =  logL^,p)  =  Y,  log  M  +  log  C l~pK )• 

J  V=l  \  U=1  J 

The  decision  rule  is 

reject  Ho  if  A( N )  <  6  where  max  P{A(Ar)  <  (5}  =  a. 

pen0 


Evaluating  A  (N) 


The  use  if  the  Likelihood  Ratio  statistic  requires  the  determination  of  6  based  on  the 
distribution  of  A (N).  The  evaluation  of  the  statistic  involves  maximizing  logL(p,  •)  over 
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Do  and  its  complement.  The  geometric  structure  of  the  null  hypothesis  space  and  algebraic 
properties  of  the  likelihood  function  in  this  case  make  it  possible  to  establish  an  algorithm 
that  facilitates  the  evaluation  process.  The  formal  proofs  are  detailed  and  reserved  for 
the  Appendix.  However,  intuitive  arguments  and  examples  are  given  to  help  the  reader 
understand  better  the  rationale  for  the  algorithm  and  the  manner  in  which  it  functions. 

Assume  in  what  follows  that  fl  C  RK .  Then  following  definitions  and  results  hold. 

(a)  If  p  (the  global  MLE)  G  n^Ho)  then  L(fi,p)  is  maximized  on  flo(no)  on  the  boundary 
between  flo  and  Hq  =  B^0.  It  is  well  known  that  the  multinomial  pdf  converges  to  the 
multivariate  normal  pdf  implying  that  the  log  multinomial  likelihood  function  converges 
to  the  the  log  of  normal  likelihood  function  which  in  turn  is  inversely  proportional  to  a 
(Mahalanobis)  distance.  Thus,  given  an  observation  p,  log L(p,  p)  behaves  like  aa  inverted 
metric:  if  p  G  flo(flo)  and  p*  G  n0(no)>  then  log L(p,  p)  decreases  as  a  function  of  p  along 
the  line  segment  £.  between  p  and  p”.  Because  L  must  cross  B^0  before  it  enters  fl0(Do)> 

A 

it  follows  that  logL(n,  p)  is  maximized  for  p  G  no(Ho)  on  Bp0. 

(b)  Because  of  (a),  the  evaluation  of  A(jV)  depends  on  points  in  Bp0.  Here  we  define  the 
constituent  parts  of  Bp-0  : 

Define  a  "one-restriction”  plane  as 

PlMK&’a  {p€JJK|P,'  =  fV}- 

In  turn  a  ”t-restriction”  plane  is  defined  to  be  for  2  <  t  <  K  and  any  nonempty  subset 
{»i,  •  •  • ,  t't}  of  the  set  {1,2,  •  •  • ,  K}  as 

plane£ =  f|  Plane^, 

Define 

EdgejP’  =  PlanejP’  ',l,)  n  H0  = 

{p  €  RK  n  n  I  P*  =  P0‘  for  *  =  >  Pq'  for  t  ^  *i , *2, *  •  ■>»<}• 

The  boundary,  Bp0,  is  the  nondisjoint  union  of  the  2K  —  1  =  (*)  edges. 

For  example,  if  K  =  3,  there  are  23  —  1  =  7  such  edges  whose  union  determines  Bp0. 
If  PQ  1  =  ,2,P02  =  .4,  and  P03  =  .6,  then  one  of  the  (3)  ‘2-restriction’  planes  is  based  on 
the  subset  {1,2,}  of  the  set  {1,2,3}  denoted  above  as 

Planel?'2)  =  Q  Plane^  =  {p  G  F3  n  fl  |  P1  =  .2, P2  =  .4} 

*'€{1,2} 


and 

Edge<-?'2)  =  {p  G  F3  n  fl  |  P1  =  .2,  P2  =  .4,P3  >  .6}. 
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P  3 


Figure  2(a)  For  K  —  3,  0  with  Ho,  the  boundary  B^,  is  shaded. 
p3 


Figure  2  (a)  illustrates  fl  for  this  example.  There  the  boundary  Bp0  is  shaded.  The 
parameter  space  0  is  the  set  of  points  inside  the  tetrahedron  fl.  The  null  hypothesis  space 
Do  is  all  of  the  points  on  B^0  and  ‘behind’  it.  Figures  2  (b)  ,  (c),  and  (d)  show  extended 
rectangular  portions  of  Plane^  for  i  =  1,2,  an  3  respectively,  indicating  the  contribution 
of  each  plane  to  B^-0 . 
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For  p,define  p(‘>*  as  the  vector  that  maximizes  log L(p,p)  on  Plane^1’  ,l'\  Con¬ 
sistent  with  the  earlier  shorthand  notation  for  the  cumulative  sums,  we  denote 

3 

p{i i, •••,*'.)/=  ^  and  pC*1  »•“•*«)/  =  p(»* . 

i/=»+i 


(c)  For  an  observed  p  the  coordinates  of  p(‘>>"  *‘»)  are  given  by  the  following  theorem. 
Theorem  l. 

If  we  denote  0  as  i‘o,  then  the  ith  coordinate  of  p  *’  ’  *  is 


Pi  P 

(aj — r.0*'— -)  if  ty_i  <  t  <  »y,  1  <  j  <  t,  and  ±  0, 

^  j-i 


v-> 


-(*1  ,  -,*«)  _  )  (b)  Poi,  if  ty —  i  <  t  <  *y,  1  <  3  <  and  P,J  =  0, 
v  i  )  >  ~ 1 

(c)  ^  -f°— ,  if  it  <i<K  and  P*'«  ^  1, 

1  7  1  -  P“ 

[  fdj  0,  if  »<  <  <  if  and  P{*  =  1. 


We  illustrate  the  use  of  Theorem  1  and  its  notation  with  a  few  examples. 

Example  2. 

Suppose  that  K  —  4.  Assume  that  p,-  p  0,  1  <  i  <  K. 

*  .  -Ji) 

(l).  We  describe,  for  a  particular  p,  the  coordinates  of  p  ,  the  point  that  maximizes 

a 

logL(p,  p)  on  the  one-restriction  plane 

Plane^}  =  {p  €  H  |  pi  =  p0i}. 

Here  t,  the  number  of  restrictions  is  1,  it  =  t j  =  1,  and  0  <  j  <  it  =  1.  Thus  j  only  takes 
the  value  1.  The  subscript  i  takes  the  values  1  to  4. 

For  i  =  1,  we  use  formula  part  (a) 


M1’  = 


__  PiW)  _  PiPoi  _ 


P1 


Pi 


—  Po  1- 


Note  that  if  p\  =0  we  would  use  formula  part  (b)  and  simply  assign  p[^  =  po  i  since  p[ 
must  be  an  element  of  Plane L1^. 

Po 


-(i) 
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For  »‘i  <  i  <  4  or  for  2  <  t  <  4,  we  use  formula  part  (c)  to  compute 


;(■)  =  P-(l--Po’) 
1  -  P1 


Thus 


-(i)  _  P 2(1  -  Poi) 

■(.)  =  P3(l-ZP0,)|  ^ 
1  “  Pi 

.(1)  _  Pa(1  ~  Po  1 ) 

^4  -i  * 

1  -  pi 


Note  that  if  any  of  the  p,  =  0  for  i  =  2,  3,  or  4,  then  naturally  the  value  of  p^  would  be  0. 

If  P1  =  Pi  =  1,  then  the  likelihood  function  depends  only  on  the  value  of  p\.  From  formula 
part  (d)  we  assign  p\  the  value  0  for  2  <  *  <  4. 

(2).  We  compute  the  coordinates  of  p  ,  the  point  that  maximizes  logL(p,p)  on  the  one- 
restriction  plane 

Planed  =  {pgn|P3  =  P03}. 


po  I  v  ) 

In  this  case  also  t  —  1  and  it  —  3.  Again  j  takes  the  value  1.  For  1  <  i  <  i}  =  i\  =  3. 
P3  7^  0,  we  use  formula  part  (a): 


If 


-(3)  _  P.-^o3)  _  P.-(po  1  +  Po  2  -r  Po  3) 

P'  P3  Pi  +  P2  +  P3 

The  likelihood  function  restricted  this  set  is 
2 

£p*logM  +P3log(P03  ~  Pi  -  Pi)  +  (l  -  P 4)  log(l  -  P03  ~  p4). 

i/=l 


Note  that  if  P3  =  0,  the  likelihood  function  does  not  depend  on  pi,  P2,  or  p 3.  However  we 
require  that  p(3)3  =  P03.  Since  the  values  pi,  P2,  and  pz  have  no  effect  on  the  value  of 
the  likelihood  function,  we  assign  to  them  the  values  poi,  Po2»  and  P03  respectively.  This 
is  an  example  of  the  application  of  formula  pant  (b). 

For  i  =  4,  we  use  formula  part  (c): 


6m  =  Mi-V) 

P<  1  -  P3 


P<(1  ~  PO  1  ~  P02  —  P03) 
1  -  pi  -  P2  ~  P3 


(3).  Now  we  compute  p  ,  the  point  that  maximizes  logL(p, p)  on  the  two-restriction 


plane 

Plan4o’3)  =  {p  e  n  | Pi  =  Poi  and  P3  =  P03}. 
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Here  t  —  2,  »i  =  1,  and  »2  =  3.  For  »  =  1,  using  formula  part  (a) 


*(i.3)  Pif^o1)  PiPoi 

Pi  = — aT—  =  -7—  =  Poi- 


For  » '  =  2  and  3, 


-(1,3)  _  P2P0  2  _  P2 (pp 2  +  P03) 
P|  P2  +  P3 

-(1,3)  _  P3-P 0  2  _  P3  (PO  2  +  P03) 
P3  "  63  “  pj+PS 


For  *  =  4,  we  use  formula  part  (c): 


-(1,3)  _  P*(l  ~  -Pp 3)  _  P<(1  —  Po  1  -  P02  ~  P03) 

4  1  -  P3  1  -  Pi  -  P2  -  P3 

Note  that  Pj1,3^  =  p^  and  p^1’3^  =  p^. 

(4).  Now  we  will  compute  p  ’  ,  the  point  that  maximizes  logL (p,p)  on 


Plane^l4)  =  {p  €  H  |  P2  =  P0  2  and  P<  =  P04}. 


Again  t  =  2,  :'j  =  2,  and  t2  =  4.  For  t  =  1  and  2, 


For  *  =  3  and  4, 


-(2,4)  _  Pi(Pp  2)  _  P«(Poi  4-  P02) 
*’  P2  P1  +  P2 


.(2,4)  _  Pi{Ppt)  _  PijPp  3  +  PO*) 

‘  “  A*  "  P3+P, 


So  far  we  have  described  the  candidate  set,  Bp-0,  where  log L(p,p)  should  be  max¬ 
imized,  identified  sets  denoted  by  Plane£o*’,a’  subsets  of  which  are  the  constituent 

parts  of  Bp-0,  and  determined  the  components  of  p'  1  ’  '  ^  that  maximize  log L(p,p)  on 
each  Planeg,’,3'‘",*,^.  Thus  for  am  observed  p  €  fig,  there  are  2*  —  1  candidate  ’  '^’s 
for  po,  the  p  that  maximizes  log L(p,p)  on  H0  when  p  £  fig.  Now  we  give  the  criteria  to 
chose  p0  from  among  these  2K  —  1  candidates. 

(d)  If  p  €  D 0  then  p0  =  p  *’  ’  where  {*1, •••,»«}  is  the  subset  of  {1,2,-  ••,-?£}  with  the 

smallest  number  of  elements  such  that  p%x'  '  ^  €  flo- 

Intuitively,  if  A  and  B  are  sets  with  A  C  B  C  {1,2 then  log L(p,p)(B^  < 
log L(p,p^A^).  Thus  parts  of  B^0,  characterized  by  more  restrictions  i’i ,  *2,  *  -  -  *<  should 
yield  smaller  values  of  log L(p,p)  than  parts  characterized  by  fewer  restrictions.  However, 
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since  set  inclusion  is  only  a  partial  ordering,  some  care  must  be  taken  to  rule  out  candidate 
sets  from  different  sets  of  orderings  with  the  same  number  of  restrictions.  This  fact  makes 
an  nontrivial  task  of  proving  that  (d)  above  is  true. 

The  result  in  (d)  can  be  used  to  construct  the  following  algorithm  for  evaluating  p0. 


Evaluation  algorithm. 

If  p  is  in  CIq,  then  logL(p,p)  is  maximized  over  Hq  at  p. 


The  maximum  value  on  Oo  can  be  found  in  the  following  fashion; 

ti») 

(1)  First  compute  the  MLE’s  for  the  “one-restriction"  planes  p  for  1  <  *  <  K.  If  for  any 
*(t)  « 

i,  p  6  n0>  then  logL(p,p},  is  maximized  at  that  point.  Stop. 
tiO 


(2)  If  none  of  the  p  €  flo,  then  begin  computing  the  MLE’s  for  the  the  “two-restriction” 

planes,  p~  *’  ^  for  t‘i  =  1,  •  •  •  K  —  1,  and  t‘2  =  tj  + 1,  •  •  •  ,K.  If  for  any  pair  (t  1 ,  i2),  p~  ’  ’  ^  € 
flo,  then  logL(p,p),  is  maximized  at  that  point.  Stop. 


(3)  If  the  restricted  MLE’s  for  the  one-restriction  and  two-  restriction  planes  are  not  in 
flo,  continue  the  procedure  established  in  (l)  and  (2)  for  the  triples  that  denote  the  MLE’s 
for  the  “three-  restriction”  planes.  If  none  of  those  are  in  fl0  go  on  the  MLE’s  on  the 
‘four-restriction’  planes  etc... 

Example  3. 

Suppose  that  K  =  4.  We  wish  to  test  a  null  hypothesis  about  4  proportions: 

Ho  :  pi  >  .2,  pi  +  P2  >  -4,  Pi  +  P2  +  P3  >  -6,  and  pi  +  P2  +  P3  +  Pa  >  -8. 

Here  pio  =  P20  =  P 30  —  P<o  =  -2.  Let  N  =  10,  ni  =  1,  ri2  =  1,  n3  =  7,  and  n4  =  1,  so 
that  Pi  =  P2  =  P4  =  -1  and  p3  =  .7.  Since  pi  <  .2,  p  ^  Ho-  Thus 

sup  logL(p,p)  =  IogL(p,£). 


The  Evaluation  Algorithm  can  be  used  to  compute  sup^erJo  logL(p,  p).  We  begin  by  corn¬ 

el) 

puting  p  .  Using  the  results  of  Theorem  1  and  following  step  one  of  the  algorithm,  t  =  1 
and  i't  =  1; 


tf5 

A" 


.2, 

•1(1  --2) 
1  -  .1 
•7(1  ~  -2) 
1  -  .1 
•1(1  ~-2) 
1  -  .1 


.0888, 
.6222,  and 
.0888. 
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-Jl)  *J[2) 

Because  .2  +  .0888  <  .4,  p  $  flo-  We  go  on  to  compute  p  .  In  this  case  t  =  1,  it  =  2, 

(„  =  -1Q2  +  -2)  =  2 

^  .l+.l 

(3)  =  42+_.ij  =  2 
.1  +  .1 

42)  =  '7[1_~i2_-12-  =  -525»  “d 


aj[2) 

Note  that  p  €  Hq-  The  algorithm  says  that  we  can  stop  the  computation: 


*  A  a|2) 

sup  logL(p,p)  =  logL(p,p  ). 

p€fi  o 


No  evaluation  of  logL(p,p)  was  necessary  until  we  found  the  edge  on  which  the  maximum 
occurred  and  there  was  no  computation  involving  the  other  (t)  +  (2)  +  (3)  +  1  ~  2  =  13 
edges. 

This  algorithm  speeds  up  the  computation  of  \og(\(N))  considerably  over  the  more 
laborious  procedure  of  finding  all  of  the  restricted  MLE’s  that  are  in  flo  and  comparing 
the  values  of  logL(p,p)  at  each  of  the  points  to  find  the  maximum  value.  Thus  for  small 
N  the  distribution  of  A(iV)  can  be  easily  computed  and  a  p-value  assigned  by  computing 
probabilities  of  multinomial  outcomes. 

The  Evaluation  Algorithm  applies  to  the  maximization  of  the  likelihood  function  when 
p  6  Hq-  It  can  not  be  applied  when  p  €  Ho-  Although  when  the  likelihood  function  restricted 
to  fig  is  maximized  on  Bp-0,  the  rigorous  argument  used  to  establish  the  key  property  of 
the  algorithm  does  not  follow.  Essentially  this  is  because  fl^  is  a  union  rather  than  an 
intersection  of  sets  whose  elements  are  described  by  inequalities  involving  the  P’  and  P0‘. 

When  p  €  Ho,  instead  of  developing  an  algorithm  for  maximizing  the  likelihood  func¬ 
tion  on  Bq,  we  simplify  the  maximization  problem  by  requiring  that  our  test  be  a  member 
of  the  class  of  all  tests  of  size  a  that  contain  at  least  one  element  of  Hq  in  their  acceptance 
region.  Since  A (N)  >  1  for  observed  p  €  flo  A(A^)  <  1  for  observed  p  6  flo,  this 
restriction  implies  that  the  acceptance  region  of  the  GLRT  must  contain  flo-  This  in  turn 
implies  that 

maxFp-{p-€tl5}  >tt. 

Because  of  Proposition  E.2.c  in  Marshall  and  01kin(l979),  noting  that  every  vector  p  £  flo 
majorizes  pb,  we  can  simplify  this  requirement  to 

JVb{?€ns}>«.  (1) 

This  restriction  is  helpful  in  evaluating  the  likelihood  ratio  and  does  not  limit  sig¬ 
nificantly  the  situations  to  which  the  test  can  be  applied.  However,  for  an  easy  check  to 
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see  if  a  specific  test  situation  meets  the  requirement  (1)  above,  use  the  fact  that  for  any 

1  <i<K, 

PP<1.  {P' <  p<,'}  <  pv-A%}  ■ 

Thus  if  for  any  »,  Ppo<  jp*  <  P0’|  >  a,  the  testing  situation  meets  the  restriction. 

This  restriction  makes  sense.  If  we  designed  a  test  for  this  set  of  hypotheses  whose 
rejection  region  intersected  with  Ho,  reasonable  persons  would  have  cause  to  criticize  it. 
Viewed  from  a  Bayesian  perspective,  an  observation  in  Ho  should  strengthen  our  belief 
that  the  null  hypothesis  is  true.  Therefore  observations  "close  to”  fl0  should  be  strong 
evidence  in  support  of  the  null  hypothesis. 

Thus  requiring  that  the  acceptance  region  contain  Ho  means  that  if  p  £  CIq,  evaluation 
of  A(Ar)  is  unnecessary;  accept  Ho.  This  creates  a  slight  problem  if  one  wishes  to  compute 

A 

a  P-value  for  an  observed  p  €  Ho  :  Since  we  do  not  evaluate  A(JV) ,  we  cannot  compute  its 
P-value.  However  this  P-value  is  bounded  below  by  the  quantity 

which  can  be  computed  without  the  evaluation  of  A  (N). 

The  GLRT  versus  Union-Intersection  Tests 

In  this  testing  situation,  a  union-intersection  test  has  the  decided  advantage  of  ease  of 
computation.  However  there  are  at  times  many  candidate  union-  intersection  tests,  making 
the  selection  of  the  “  best  ”  one  difficult.  In  fact  it  can  be  shown  that  the  rejection  region, 
of  the  GLRT  is  approximated  by  such  tests  in  the  sense  that  there  exist  at  least  two 
such  union-intersection  tests,  whose  rejection  regions  Ri  and  £2  satisfy  D  D  £2. 
This  approximation  can  be  made  “  optimally”;  it  can  be  shown  that  there  is  a  union- 
intersection  test  whose  power  is  less  than  or  equal  to  the  power  of  the  GLRT  but  greater 
than  or  equal  to  the  power  of  any  other  union-  intersection  test  whose  rejection  region 
is  contained  in  R\.  This  test  would  be  a  reasonable  substitute  for  the  GLRT  since  it 
should  have  approximately  equal  power  and  size  and  be  much  easier  to  compute  than  the 
GLRT  statistic  A  (N).  However,  this  “approximating”  union-intersection  test  may  not  be 
the  “natural”  choice  among  several  competing  union-  intersection  tests,  especially  when 
K  >  3.  Without  the  aid  of  computing  the  rejection  region  of  the  GLRT  described  here, 
there  is  no  obvious  criteria  for  selecting  it  from  among  its  competitors. 

As  illustrated  by  the  following  two  examples,  the  decision  whether  to  go  to  the  added 
trouble  and  expense  of  the  GLRT  depends  on  the  set  of  quantiles  being  tested. 

Figures  3  and  4  compare  the  GLRT  versus  a  “natural”  union-intersection  test  for  testing 
the  hypothesis 

Ho  :  pi  >  .25,  pi  +  P2  >  -75,  and  pi  +  P2  +  P3  >  .95,  with  N  =  10. 

The  GLRT  rejects  H0  if  log  (A(Ar))  >  5.9  with  a  size  a  =  .052. 
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In  order  to  construct  the  Union- Intersection  Test,  we  reproduce  a  portion  of  the  Bino¬ 
mial  Tables  for  N=10.  The  entries  are  the  cumulative  probabilities  of  the  binomial  random 
variable  with  parameter  p. 

Table  2:  Abbreviated  Binomial  Table  for  N  =  10. 


n0\p 

.25 

.75 

.95 

0 

.0563 

.0000 

.0000 

1 

.2440 

.0000 

.0000 

2 

.5256 

.0004 

.0000 

3 

.7759 

.0035 

.0000 

4 

.9219 

.0197 

.0000 

5 

.9803 

.0781 

.0001 

6 

.9965 

.2241 

.0010 

7 

.9996 

.4744 

.0115 

8 

1.0000 

.7560 

.0861 

Denote  any  union-intersection  test  that  tests  Ho  by 

(.25,  .75,  .95,  n°,  nf,  n°;  N), 
indicating  that  the  rejection  region  is 

{"1  <  "i)  U  {ni  +  n2  <  n®}  U  {ni  +  n2  +  n3  <  n°}. 

We  select  (.25,  .75,  .95,  0,  1,  6;  10),  the  union-intersection  test  that  rejects  if  nj  =  0, 
ri!  4-  n2  <  1,  or  nj  +  n2  +  TI3  <  6.  The  Bonferroni  upper  bound  on  its  size  is  .0573.  Its  true 
size  is  .05718. 

Note  some  other  competitors:  might  be  chosen  to  be  any  number  between  2  and 

7. 
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Figure  3  Rejection  Regions  of  the  GLRT  and  the  Union-Intersection  Test 

(.25,  .75,  .95,  0,  1,  6;  10) 

Note:  The  eleven  graphs  that  comprise  Figure  3  compare  the  rejection 
regions  of  the  two  tests.  The  rejection  regions  are  given  for  each  level  of  ri3. 

The  symbols  on  the  graphs  have  the  following  significance: 

Dots  (•)  mark  the  points  not  in  the  rejection  region  of  either  test. 

Asterisks  (*)  mark  points  that  are  common  to  both  rejection  regions. 

O-dots  (©)  mark  points  that  are  in  the  rejection  region  of  the  GLRT  but 
not  in  the  rejection  region  of  the  union-intersection  test. 

X’s  mark  points  that  are  in  the  rejection  region  of  the  union-intersection 
test  but  not  in  the  rejection  region  of  the  GLRT. 
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Figures  4  (a)  -  (g)  Powers  of  the  GLRT  and  the  Union-Intersection  Test 
(.25,  .75,  .95,  0,  1,  6;  10)  Compared 

Note:  Comparisons  of  the  powers  of  the  two  tests  are  made  at  selected  points 
along  rays  leaving  the  point  po  =  (.25,  .5,  .2)  toward  fig.  Each  ray  takes  the 
form  p  =  po  —  7  d.  Here  7  >  0  is  the  abscissa.  The  power  is  graphed  as  the 
ordinate.  The  direction  vector,  d,  is  an  element  of  the  subset  of  i?3  that  consists 
of  the  23  —  1  =  7  nontrivial  vectors  of  the  form  {di,d^td3)  where  d,-  =  0  or  1, 
i  =  1,2,  and  3.  Points  in  fl0  were  taken  along  these  seven  rays  for  values  of 
7  =  .025,  .05,  .075,  .1,  etc.  until  the  smallest  of  the  three  coordinates  of  a 
selected  point  was  less  than  or  equal  to  .05.  Points  with  at  least  one  coordinate 
as  small  as  .05  are  near  the  boundary  of  Q  and  into  areas  of  extreme  alternatives 
of  little  interest. 

The  solid  line  is  the  power  of  the  GLRT. 

The  dotted  line  is  the  power  of  the  union-intersection  test. 
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(d)  d  =  (1, 1,0).  Points  are  of  the  form  (.25  -  7,  .5  -  7,  .2) 


1*0 


0 


-I* 

0.023  0.075  0.123  0.173  0.223  0.275  0.225  0.375  0.425 

0  0.05  0.1  0.15  0.2  0.25  0.3  0.25  0.4 


(e)  d—  (1,0,  l).  Points  aje  of  the  form  (.25  -7,  -5,  .2-7). 


(f)  d=  (0,1,1).  Points  are  of  the  form  (.25,  .5  -  7,  .2  -  7). 


21 
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We  see  from  Figures  4  (a)  -  (g)  that  the  power  of  the  GLRT  is  larger  at  every  point 
considered  and  substantially  larger  at  most  points. 

The  test  (.25,  .75,  .95,  0,  1,  6;  10)  is  not  the  “optimal”  union-  intersection  test  whose 
power  is  approximately  that  of  the  GLRT.  That  test  rejects  if  nj  +  n2  <  3,  or  +  n2-rn3  < 
7.  It  does  not  consider  values  of  ni  as  part  of  its  rejection  region.  From  Table  2,  we  see 
that  the  upper  bound  on  its  size  is  .0150.  An  investigator  would  not  choose  this  test  if  he 
or  she  were  constructing  a  union-intersection  test  with  .05  as  an  approximate  size. 

The  selection  of  the  particular  set  of  d*s  to  be  used  in  Figure  4  was  not  arbitrary. 
The  vectors  used  in  (a),(b),  and  (c)  can  be  interpreted  as  alternatives  that  describe  the 
‘breakdown’  of  the  null  hypothesis  in  only  one  quantile.  For  example,  (a)  are  points  in 
that  fail  to  be  in  flo  because  of  their  first  coordinates;  the  “interquantile  range”  is  adequate 
for  the  second  and  third  quantiles.  These  distributions  fall  short  only  on  the  first  specified 
quantile.  Likewise  (c),  (d),  and  (e)  are  points  that  fail  to  be  members  of  fl0  because  of  two 
coordinates.  This  is  a  more  pathological  situation  than  the  situation  in  (a),  (b),  or  (c). 
Figure  4  (g),  in  the  sense  of  ordering  described  so  far,  represents  the  most  extreme  case  of 
the  failure  of  the  null  hypothesis  to  be  true.  In  this  case,  the  distribution  comes  up  short 
on  all  three  quantiles.  The  graphs  indicate  that,  for  the  same  -y,  both  the  GLRT  and  the 
union-  intersection  test  have  higher  power  on  (g)  than  on  (d),  (e),  or  (f)  and  higher  power 
on  these  three  than  on  (a),  (b),  and  (c).  In  this  sense  the  power  of  both  tests  is  sensitive 
to  the  number  of  parameters  that  fail  to  meet  the  requirements  of  the  null  hypothesis. 

All  of  the  graphs  in  Figure  4  except  (b)  appear  incomplete.  This  is  because  rays  from 
po  can  extend  for  longer  distances  and  remain  in  fl  for  some  d.  than  for  others. 

For  testing 


H0  :  Pi  >  .25,  Pi  +  P2  >  -75,  and  Pi  +  P2  +  P3  >  -95, 

Figures  4  (a)-(g)  illustrated  an  advantage  of  the  GLRT  over  the  union-intersection  test 
(.25,  .75,  .95,  0,  1,  6;  10)  with  respect  to  power.  Not  only  is  the  power  larger  in  every 
case,  but  the  Type  I  error  is  closer  to  the  specified  value. 

The  example  that  follows  illustrates  a  case  where  the  GLRT  serves  only  to  “fine 
tune”  the  Type  I  error  rate  of  the  union-intersection  test  rather  than  to  improve  its  power 
appreciably.  Consider  testing 

H0  :  pi  >  .3,  pi  +  pi  >  .6,  and  Pi  +  P2  +  P3  >  -9. 

The  GLRT  rejects  if  -2log(A(N))  <  4.8  with  a  size  a  =  .0503.  We  display  Table  3  in 
order  to  illustrate  the  selection  process  for 

(.3,  .6,  .9,  n°,  n°,  10). 
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Table  3:  Abbreviated  Binomial  Table  for  N  =  10. 


n0\P 

.30 

.60 

.90 

t 

0282 

.0001 

.0000 

1 

.i493 

.0017 

.0000 

2 

.3828 

.0123 

.0000 

3 

.6496 

.0548 

.0000 

4 

.8497 

.1662 

.0001 

5 

.9527 

.3669 

.0016 

6 

.9894 

.6177 

.0128 

7 

.9984 

.8327 

.0702 

8 

.9999 

.9536 

.2639 

We  select  the  union-intersection  test  (.3,  .6,  .9,  0,  2,  6;  10).  The  upper  bound  on  its 
size  is  .053.  Its  true  size  is  .048.  This  is  the  union-intersection  test  described  earlier  whose 
rejection  region  is  contained  in  the  rejection  region  of  the  GLRT  and  has  the  highest  power 
of  any  union-  intersection  test  with  Z  C  Z\. 

The  rejection  regions  of  these  two  tests  are  identical  except  that  the  GLRT  rejects  Ho 
if  (ni,n2,n3)  =  (1,2,4),  while  the  union-intersection  rejection  test  does  not.  The  powers 
of  these  two  tests  will  obviously  be  similar  except  for  alternatives  close  to  this  point  and 
even  for  such  alternatives,  the  powers  only  differ  by  approximately  .00025,  the  mass  of 
that  point  when  (pi,P2,P3)  =  (.1,  .2,  .4).  The  marginal  improvement  that  is  the  result  of 
using  the  GLRT  is  to  increase  the  Type  I  error  rate  to  almost  exactly  .05. 

Summary  and  Conclusions 

The  purpose  of  this  work  reported  here  is  to  construct  a  small  sample  test  procedure 
to  test  the  null  hypothesis 

t  i 

Ho  :  P"  -  £  P°  •/>  1  ^  ^  K 

»/=i  »/=i 

which  is  superior  to  any  union-intersection  test  based  on  1^1= i  A’V,  1  <  * <  K,  with  critical 
values  chosen  from  the  binomial  tables.  There  is  little  computation  needed  to  apply  the 
union-intersection  test.  One  simply  observes  ni,  ni  +  n2,  •••,  nv.  If  any  one  of  these 

is  smaller  than  a  corresponding  predetermined  critical  value,  the  null  hypothesis  is  rejected. 
The  Type  I  error  rate  of  this  test  can  only  be  estimated  from  above  by  the  binomial  tables. 
For  K  >  3  there  are  usually  several  different  combinations  of  critical  values  from  which  to 
choose.  The  tables  give  no  criteria  upon  which  to  base  a  decision  as  to  which  combination 
is  the  best  choice.  Of  course  one  can  compute  the  multinomial  probabilities  upon  which 
the  Type  I  error  rate  of  a  test  based  on  a  particular  set  of  critical  values  depends,  but 
there  is  no  guarantee  that  any  union-intersection  test  can  be  constructed  yielding  an  a 
close  to  some  predetermined  level.  Even  if  one  is  found,  it  may  be  such  that  for 

some  i  <  K  may  not  be  considered  in  the  decision  process.  This  is  unsettling  if  we  keep 
in  mind  that  this  is  supposed  to  be  a  simultaneous  test  on  all  of  the  quantiles. 
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Juxtaposed  to  the  simplicity  and  uncertainty  of  the  union-intersection  test  is  the  com¬ 
plexity  and  certainty  of  the  GLRT.  An  algorithm  is  provided  that  facilitates  the  evaluation 
of  the  statistic.  Although  computation  of  the  p-value  or  critical  value  is  prohibitively  com¬ 
plicated  if  done  by  hand,  on  a  AT-Type  PC,  computations  of  p-values  for  K  =  7  and 
N  =  20  take  less  than  5  minutes.  It  has  been  illustrated  that  the  relative  superiority  of 
the  power  of  the  GLRT  over  the  union-intersection  test  depends  on  the  choice  of  the  set  of 
critical  values  of  the  union-intersection  test  and  the  set  of  null  hypothesis  parameters.  It 
appears  that  the  most  apparent  choice  of  critical  values  for  the  union-intersection  test  can 
yield  a  test  with  far  less  power  than  the  GLRT  even  though  there  is  a  union-intersection 
test  whose  power  is  less  than  but  approximately  the  same  as  power  of  the  GLRT.  That 
test  will  more  than  likely  not  be  chosen  by  the  researcher  because  the  upper  bound  on  its 
Type  I  error  rate  is  below  the  nominal  rate  and/or  it  requires  excluding  one  or  more  of 
the  quantiles  from  consideration.  It  is  because  of  the  uncertainty  of  the  union-  intersection 
test  that,  in  spite  the  complexity  of  computation,  the  GLRT  can  be  a  useful  alternative. 
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Appendix  A 


Proofs  of  Results  Supporting  the  Evaluation  Algorithm 

We  restate  Theorem  1. 

THEOREM  1.  Define  i o  to  be  0  : 


*(u  |•••|*^ ) 
Pi 


W 


Pi  Po  l  , 

PL 


if  ij-i  <  i  <  t; ,  1  <j<t,  and  jL  0, 


_  <  (b)  Poit  if  t'y-1  <  *  <  *y»  1  <  y  < 

(c)  P-^  if  *«  <  »  <  if  and  P^  ±  1, 

1  1  -  P“ 

[  (dj  0,  if  it  <i<K  and  P‘‘  =  1. 


PROOF: 


The  case  when  t’t  +  1  <  K  : 

If  * y _ i  +  1  =  ty,  write  the  ijth  restriction,  Px>  =  P0  3 ,  as  Pb- 1  +  ptj.  =  P0  ;_1  -f  Poi, 
Since  restriction  »y_i  is  PV-»  =  Po1-1 ,  P/  =  Pi.  and  Po  /  =  Poi.  •  Thus 

J  j—i  *  j- i  J 

.... ) 

Pt- .  —  Po  t,-  —  — — 

1  ip') 


V-1 


and  the  given  formula  part  (a)  holds  trivially. 

Assume  that  for  the  jth  restriction  1  <  j  <  t,  ij- i  +  1  <  »y.  As  was  noted,  the  t 

restrictions  placed  on  the  coordinates  of  p  that  define  Plane^’  can  be  written,  for 

+1 

Rewriting  logL(p,  p)  using  the  t  restrictions  in  this  form  we  obtain 


t  r  •>  - 1 

E 

.*/=«,•_  i  +  i 


y=i 


p„  log (p„)  +  p.-,  iog(p0 j  -  Pi..,1) 

:  i  i  '  ' 

(* 


* 


+  X^P^Jog(p^)  + 

^=«i  +  l 


log  1  -  P0 


*0' 
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Assume  also  that  P'}  ^  0  as  in  formula  part  (a). 

1 

We  are  seeking  an  expression  for  p[)1'  for  »y-i  + 1  <  u  <  ty.  The  only  part  of  logL (p,p) 
involving  p for  such  u,  is 


J 

E  p*  ioe  w + Pi,-  iog  (p0  y 


(Al) 


(The  parameter  p,-.  is  represented  in  (Al)  by  PQV  —  PJ  1  since  the  jth  restriction 
v  3  'j-i  V-j 

requires  that  PQV  =  PA  .) 

y  - 1  y — i 

Taking  partial  derivatives  with  respect  to  pu  in  (Al)  and  setting  them  equal  to  zero 
yields  the  following  system  of  equations: 


P* 

P* 


Ply 


P  A  _  D  3 

Oi-  ,  ■r»  , 

j-l  3-1 


i . — i  >  ^  ~  */ — i  "b  i  >  ■  *  ■  >  *V 


l. 


It  follows  that 


»"'»'« )  —  _ j  L 

r  1/ 


,  u  =  *  y — i  +  1,  • 


>  */ ■ 


In  the  special  case  that  Pt-J  ^  =  0,  as  in  formula  part  (b),  expression  (Al)  is  identi¬ 
cally  0,  i.e.,  the  restricted  likelihood  function  is  not  a  function  of  pv  for  »y_x  +  1  <  v  <  :‘y 
whose  values  have  no  effect  in  maximizing  the  likelihood  function  on  Planel'1  ’  The 


restriction  on  p 


(*'j, 


for 


t'y-1  +  1  <  v  <  ij  is  that  E!J_1  +  i  pL”’ "  ,,,)  =  Poy  t .  Thus,  for 


P  0 


consistency  we  may  define  pi*1’"  ’**^  =  p0„  for  such  u. 


Now  let  it  +  1  <  v  and  assume  P‘*  ^  1  as  in  formula  part  (c). 
The  portion  of  logL(p,p)  involving  such  pv  is 

K 

E  P*'1°g(P«')  +  (l  -  PK)  I°g(l  -  Po"  -  px\)- 

t/=l't  +  l 


Again  taking  partial  derivatives  with  respect  to  pv  for  it  +  1  <  v  <  K  and  setting  them 
equal  to  zero  yields  the  system  of  equations 


It  follows  that 


pv  _  l~pK 
Pu  1  -  P0  “  -  P*  ' 


«(*’»  .•.*«) 
r  v 


Pu{  1  -  Pp) 
1  -  P“ 


28 


In  the  case  where  Pu  =  1,  as  in  formula  part  (d),  p„  =  0  for  t*  +  1  <  u  <  K. 
Since  there  are  no  restrictions  on  these  parameters,  we  use  the  global  MLE’s  and  define 
pi'*’"  '** )  =  pv  =  0  for  such  v. 


The  case  when  it  =  K  : 

In  this  case  the  expression  for  logL(p,p),  employing  the  t  restrictions,  simplifies  to 


E 


■  ,j  -  * 

p»  iosM + (po  y._ , 


(1-PK)  \og(l-P0K). 


The  case  it  +  1  <  v,  the  conditions  for  tha  application  of  formula  parts  (c)  and  (d),  do  not 
occur.  Thus  the  previous  argument  for  t*_i  +  1  <  u  <  it  =  K  holds  here.  <> 

The  following  theorem  validates  the  Evaluation  Algorithm: 


THEOREM  2.  If  p  €  ftg  then  po  =  p~  *’  ’  \  where  {*i,  ••*,**}  is  the  smallest  subset  of 

{1,2, •••, K}  such  that  p'  ^  €  Ho- 

This  follows  from  the  following  sequence  of  lemmas. 

1  )  ^ 

LEMMA  1A.  Ifp  €  fl0  and  p  ^  Ho,  then 

P‘*  <  P0  . 


PROOF:  We  are  given  that  p €  ft0  or,  pMl  >  P0l,  V/.  But  the  assumption  that 
p  £  fl0  implies  that  for  at  least  one  /,  Pl  <  P0l  : 

(1)  If  /  =  t‘i  and  P'1  <  P0'\  we  have  the  result. 

(2)  If  /  <  i'i  —  1  and  Pl  <  P0 1,  for  such  l  from  Theorem  1  formula  parts  (a)  and  (b) 

p('^1  >  PQl  <=>  P'P0'1  >  P0  tp'1 . 

The  assumption  that  P1  <  P0 1  implies  that 

PlP0  ’*  >  PlP ’* . 


The  assumptions  that  >  P0 

to  conclude  that 


and  that  P*1  ^  0  assure  that  P*  ^  0.  This  permits  us 

Pil  <  P0il. 
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(3)  If  t’i  <  /,  then  from  Theorem  1  formula  parts  (c)  and  (d), 


P(‘ »)'  >  P0l  <=>  Plx  (1  -  P0 “ )  >  Pq,^  (l  ~  P‘‘)- 

Subtract  the  term  Pi  •  P-  from  both  sides  of  the  above  expression  to  obtain 

l 

>  pQi  «  P-^l-Po')  ^Po!^  (l-P*)- 

Recall  that  P1  <  P0l  or  l  —  P0l  <  1  —  Pl .  Thus 

(1  -  Po‘)  >  Pci,  (1  -  P')  >  Pol,  (1  -  Pc1)- 

Considering  only  the  extreme  terms  and  dividing  both  sides  by  1  —  P0 1  we  obtain  Pj  > 
P0 !  .  But  if  P‘  <  P0 then  P‘»  <  P0  *  * .  0 

If  we  denote  p  as  p^  °\  the  superscript  t‘o  denoting  that  it  is  the  ‘0  restriction  MLE’, 
then  we  may  restate  Lemma  1A  as: 


If  ~p  °’  *)  €  Ho  and  p~  $  fl0,  then 


P(*'»)‘‘  <  P0’‘ 


In  this  context,  the  following  lemma  is  the  analog  of  Lemma  1A  when  we  treat  a  ‘t  restric- 

■  •  i  r  ▼  1  -f  *  *  *  1 

tion  MLE  ,  p 

LEMMA  2  a.  Let  t  >  2  and  assume  that  p~  *’  ’  €  Ho- 

(1)  (t  n0,  then  p(»»  <  P0’*. 

(2)  Ifp'3,'3  ''")  (£  n0,  then  p(‘»  <  po*'» 

(3)  If  for  any  jt  2  <  j  <  t  -  l,  p‘%1'  +  £  n0,  then 

p(»'i  •  »V-i i*'»)*V  <  pQ v 


The  proof  of  this  lemma  is  similar  to  the  proof  of  Lemma  1A  and  will  be  omitted. 

LEMMA  3  a.  If  the  number  of  restrictions  t  >  1  and  p'  ' ’  ’  6  flo  but  none  of  the 

edges  characterized  by  any  of  the  t  subsets  of  order  t-1  of  the  set  {»!,•••,  »<}  contain  their 
restricted  MLE’s,  then 


p'<‘  , (i  -  V')  <  Poll  ,(i -A'),  i  < j  <  i. 


v-j 


V-1 


(.42) 


PROOF:  Note  that  the  assumptions  of  this  lemma  are  the  same  as  the  assumptions  of  Lem¬ 
mas  1A  and  2A.  We  can  apply  the  results  of  those  lemmas  to  state  that  P('3 <  P0‘l , 
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must  be 


p(» ^  p  it 


<  pQ*\  ...  p(«i.3-‘, -a-:.).',-.  <  p0*'-«>  an(j  <  pQ‘.  an 

true. 

(1)  Start  an  induction  argument  with  j  =  t. 

Because  P(“‘a •■■**-» )*«—i  =  pQ  ‘‘-1 f 

p(i,.3 •••»«_,)»'«  <  po». 

<=►  A-'_,  (i  -  Po‘-‘)  <  Pot,  (x  - 

from  Theorem  1  formula  part  (c).  Subtracting  Pj*  Pn‘‘  from  both  sides  of  the  last 

•  t-I 

expression,  we  have 

PtA1-poi‘)<Pot,(1-p<')- 

This  theorem  holds  for  j  =  t. 

(2)  Now  we  argue  by  "reverse”  induction;  we  assume  that  the  result  holds  for  j  +  1  and 
show  that  it  holds  for  j. 


If  it  is  true  for  j  +  1  we  have 

pV+1  (l  -  P0‘*‘)  <  P0  V+I  (l  -  P‘‘). 

Recall  that 


p(»i  —  V-»*j+i  •■■it)ij  <  */ 


p(‘V->‘V+»)‘V  <  p  y 


j-i 


p'V  p  V+i  v  p  *>  p’V+» 
V-1  0 V-» 


Subtracting  Pt-J  P0  V  from  both  sides  we  write 


j-i  j-i 


M3) 


p(»v-i»v)v  <  p0v_j  <=►  p >  pn/+i  <  pny  pv+\ 


Now  multiply  (A3)  by  P0  V  and  apply  this  last  expression  of  p(*V-»«V)«y  to  obtain 


j-i 


P!;_,P°‘‘"  i.1  -  Po‘')  <  Po  t  Pot  <  Po  t ,  Po  t' '  (x  -  >'■ )  • 

The  extremes  of  the  above  inequality  with  both  sides  divided  by  P0  V+I  yield 

y 
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0 


This  is  the  desired  result  for  j.  By  induction  the  result  holds  for  1  <  j  <  t. 

We  now  have  established  the  prerequisites  to  prove  Theorem  2,  the  theorem  that 
supports  the  evaluation  algorithm. 

A  (  Jj  J 

PROOF  OF  Theorem  2:  (1)  Assume  first  that  t  =  1,  p  €  ns,  and  p  e  n0.  We  will 
prove  that 

«  -  -jv, ) 

max  logL(p,  p)  =  logL(p,  p  ) . 

p€Oo 

Written  with  respect  to  terms  involve  P0  ** 

logL(^^M))  =  P“  log(P0‘‘)  +  (1  -  P’>)  log(l  -  P0 ’*)+<?  =  h(P0< ■)  +  C. 

*  ii‘i) 

Here  C  represents  all  those  terms  in  logL(p,p  )  that  do  not  depend  on  P0*‘ .  Taking  the 
derivative  of  h(.)  with  respect  to  P0'1  leads  to  the  conclusion  that  the  maximum  value  of 
the  likelihood  function  restricted  to  the  plane  satisfying  the  restriction  P*‘  =  p  decreases 
with  increasing  values  of  p  on  the  interval  p  >  P*‘.  Now  consider  any  point  p*  €  Ho-  By 
the  definition  of  Ho,  the  first  tx  coordinates  of  p"  must  satisfy  P0l1  <  Pv  =  P"'- 

Thus  p*  lies  on  a  plane  parallel  to  Plane^  but  with  elements  satisfying  the  restriction 
P‘ 1  =  P**‘.  Because  of  this  the  maximum  value  of  the  likelihood  function  restricted  to 
the  plane  satisfying  the  restriction  P“  =  P**1  is  not  larger  than  logL(p,^’1  ^).  But  p"  is 
an  element  of  that  parallel  plane  and  thus  logL(p,p*)  <  logL(p,^  ^).  We  conclude  that 

~p  1  ^  maximizes  logL(p,p)  as  a  function  of  p  on  Do.  The  theorem  is  proved  for  this  case  if 
P‘l  <  P0'1  but  this  follows  from  Lemma  1A. 

Now  consider  the  case  when  p  '  ’  ’  ^  €  fio,  t  >  2,  and  none  of  the  MLE’s  restricted 
to  “planes”  characterized  by  the  t  subsets  of  {»i, •••,»<}  with  t  —  1  elements  are  in  D0. 

We  will  show  that  the  log-likelihood  function,  logL(p,p),  is  maximized  on  ft0  by  p’'1'  . 

Proceeding  in  a  similar  fashion  as  the  case  above,  we  write 

logL(p,p('l*"■’,,))  = 
t 

£  A- +  (1  _  **') 106(1  -  vo  + c  ■ 

y=i 


The  last  term,  C,  is  a  constant  with  respect  to  P0t’J  for  1  <  j  <  t.  For  any  p  €  fl,  denote 


j-i 


Pj  =  P,J  for  1  <  j  <  t.  Consider  the  above  expression  as  a  function  of  the  p,  ; 

3-  »  J 

f{pi,p2,---,pt)  =  E(A\ t )  log(py)  +  (i-P“)log(l-EP;)- 

i=i  3  '  i=i  / 
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Letting  pj  =  P0^_x , 


iogi(p,p£i,“‘,t*))  =  f[p°,p°,-  +  c 


The  partial  derivatives  of  /(•)  with  respect  to  pj,  1  <  j  <  t,  yield  the  following 
conditions: 


df_ 

dpj 


<  0 


p‘j  « . 

1-P“ 


v-» 

PS 


Pi/ 


j/=1 


^..f1  f®r 

'  l/=l  ' 


1  <j<t. 


(A4) 


Note  that  if  pj  =  p°,  then  (A4)  is  exactly  (A2)  and  (A2)  is  true  under  the  given  conditions. 
Define 

Dpo  =  {p  €  |  Pj  >  pj,  vy,  1  <  J  <  <}. 

Inequality  (A4)  is  true  for  any  p  G  Dp0.  Choose  any  p*  €  fl0  and  let 


/>;=  E  »c. 

Note  that  p*  e  Dp0.  From  (A4) 

logL(p,pi“’"',*))  =  /(p?,P2»-‘*.P?)  +  C  > 

f[P*liP2i’"iP<t)  +  C  >  /(pJ,P2,*-’  ,P°)  +  C  >•••> 
/(Pi,P2>-'-,P?)  +  c  =logL(p,p'). 

Since  p*  waa  an  arbitrary  element  in  Do,  we  have  proven  the  theorem. 


<> 


33 


Appendix  B 


Proofs  of  Results  Supporting  the  Approximation  of  the  Rejection  Region  of  the  GLRT  with 
Rejection  Regions  of  Union- Intersection  Tests 

Let  p  €  RK,  0  <  pi,  and  £*=1  p»  <  1. 


DEFINITION. 

(l)  Denote  by  T(p ,)  the  univariate  binomial  test  with  acceptance  region 


jp  |  P%  >  P.J- 


(2)  The  union-intersection  test  whose  acceptance  region  is 

P||p|p‘ >p,|  is  then  denoted  p|  T(p,). 

»'=n  '  '  »= i 

It  is  well  known  that  for  each  i,  T(p,)  is  the  uniformly  most  powerful  unbiased  test  of  its 
size  for  testing  the  hypothesis 

Hq  :  P*  >  PQ  *. 

Arguments  that  follow  will  show  that  the  rejection  region  of  the  GLRT  can  always 
be  bounded  above  and  below  in  the  sense  of  set  inclusion  by  the  rejection  regions  of  two 
union-intersection  tests  of  the  type  defined  in  the  definition  (2)  above.  This  result  has 
consequences  both  for  the  determination  of  the  acceptance  region  of  the  likelihood  ratio 
test  discussed  here  and  for  power  comparisons  with  other  tests. 

Lemma  IB  is  used  to  prove  Lemma  2B.  Each  are  proven  are  proven  similarly  to 
Lemma  2A  of  Appendix  A  and  thus  are  only  stated  here.  Lemma  2B  and  3B  are  used 
to  prove  Lemma  4B,  the  central  result  used  in  the  identification  of  the  lower-bounding 
Union-Intersection  Test. 

LEMMA  IB.  Under  the  conditions  of  Lemma  1A  or  2  A  (Appendix  A),  we  have 

p(v-.)vt<  pQ‘V  j  =  2, •••,*. 

Lemma  2B. 

Under  the  conditions  of  Lemma.  1A  or  2A  (Appendix  A),  Px>  <  PQ1’ ,  for  l<j<t. 


This  next  lemma  describes  the  behavior  of  the  univariate  likelihood  functions  of  the 

EUi*v>  i  <«  <k. 


* 


LEMMA  3B.  The  function 


ga(x)  =  x log  (  -  j  +  (1  -  z)  log 


where  0  <  a  and  0  <  x  <  1, 


increases  for  all  x  <  a. 


Using  the  expression  given  in  Theorem  1  for  the  coordinates  of  p  1  ,  consider 


logL^’0)  -logL(£#  =  X>log(^J  +  PJog^'f-S°,» 


1  -  P n 


/  -I  _  p  »» 

(l  -  PK)  log  i— 
v  'll  -P" 


pil  log[w)  +  (1 "  P<1)  log(l^)  =  V1  {pil)' 

Lemma  3B  says  that  g  .  (P'1)  is  an  increasing  function  of  P'1  when  P“  <  P0'' .  We 

Fo 

keep  this  in  mind  in  the  proof  of  the  next  lemma. 

LEMMA  4  B.  If  A (N)  is  the  test  statistic  for  the  GLRT  discussed  here  whose  acceptance 
region  contains  n0,  then  its  rejection  region  contains  the  rejection  region  of  a  Union- 
Intersection  Test. 


PROOF:  Let 


Zf  =  {log(A(W))  <  log(£)}, 


the  rejection  region  for  the  GLRT  for  some  error  rate  a.  Assume  that  6  and  a  are  such 
that  Ho  is  contained  in  £|,  the  acceptance  region.  Assume  that  p  6  Qq-  As  defined  in  the 
main  body  of  this  paper,  p0  satisfies 

sup  logL {p,p)  -  logL(p,p0) 

p€Oo 


and  define 


Regional, •  ••,»'()  =  {p  S  Go  I  Po  =  ’  ^ 


By  definition,  p  €  Region(i i ,  -•*,**)  =>•  p  €  Hq.  Since 

Planed1’  C  Plane^'0, 


we  have 


logLtp,^*1  ’  ’*'))  <  logL(p,^t,)). 
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For  such  p,  the  denominator  of  A (N)  is  L(p,p).  Subtracting  L(p,p)  from  both  sides  of  the 
last  inequality  yields 

log  (A  (A1))  =  N  (logLfp,  'p1'  '"*)  -  logL(p,£))  <  N  (\ogL(p,p''))  -  \ogl('p,'p))  = 

"Sr  ,.{?■)■  US) 

0 

From  Lemma  2B,  for  such  p,  P‘*  <  P0‘*  and  from  Lemma  3B  and  the  discussion  that 
followed,  there  is  a  p^ ,  defined  by 


such  that 


Pi,  =  max  . 
o  <p<p0“ 


m 


iog(6) 

N 


<=>  P"<Pl • 


From  (A5),  it  follows  that 

pe  Region (t i, •  ••,:<)  and  Pu  <  p,*t  =>•  log(A(JV))  <  log(<5). 


(A7) 


We  conclude  that 


Region (»i ,  •  •  • ,  it)  n  {p  |  P‘‘  <  p*it }  C 


In  the  case  that  p?  =  0, 


Region(iU"  •  ,»*)  n  |p  |p’‘  <p-}  =  0C  Z6. 

By  complementation  and  the  definition  of  p*ix ,  it  follows  that  if 

p  €  Regional, •••,»*)  n  ££, 

then 

1ob(a(ao)  >  log(<)  *•  (?••)  >  <=►  P"  >  pi. 

If  there  exists  a  subset  of  {l,  2,  •  •  • ,  K }  containing  it  such  that 

Region (t j, •  •  • , it)  n  Zc6  ^  0,  for  such  p  we  have  the  inequality  p?t  <  Pl*  <  P0‘‘ 
and  conclude  that  the  acceptance  region  of  T(p*it)  contains  the  set  {p  |  P‘*  >  P0’*}; 

{p|p'->p0-}c{p|p'>p;t}. 

Define  $,  =  collection  of  all  subsets  of  the  set  {1,2,  •  •  •  ,K}  that  contain  t.  Note  that 
OS  =  Region(l)  U  Region(2)  u  Region(l,2)  U  Region(3)  U 
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Region(l,3)  U  Region(2, 3)  U  •••  U  Region(l, 2, •  •  • , K). 

Equivalently,  we  wiite 

K 

ns  =  U  U  Reg‘on(5)- 

t=i  ses{ 

If  for  each  t,  1  <  t  <  K,  Region (5)  ^  0  for  some  S  G  S,,  we  have 

IC  K 

Ui  5  f){*  I p<  Z  ”<)  2  fl^l  p<  2  po‘}  =  IV  [At) 

«=1  V=1 

Note  that  Ucs  is  the  acceptance  region  of  a  union-intersection  test  characterized  by  the 
collection  of  tests 

{r(p-),  i  <■'<*•}• 

It  follows  from  (A8)  that 

K 

Us  c  =  (J  (J  Region(S). 

»  =  1  S<ZS; 


If  p  G  Us  then  p  G  S  for  5  G  Si,  for  some  t,  1  <  i  <  K.  From  (A8),  we  deduce  that 

P'  <  log(A(7V))  <  log(^)  =>•  p  G  Rs- 

We  conclude  that  the  rejection  region  of  this  union-intersection  test  is  contained  in  the 
rejection  region  of  the  GLRT  or  the  acceptance  region  of  the  GLRT  is  contained  in  the 
acceptance  region  of  this  union-intersection  test. 

It  has  been  assumed  that  for  each  »,  1  <  :  <  Ky  at  least  one  of  the  sets  in  5,-  is 
nonempty.  This  is  needed  to  show  (A8).  If  there  exists  an  i'  such  that  S{  is  a  collection  of 
empty  sets,  then  for  no  p  G  Hq  is  the  likelihood  function  maximized  on  Edge(»)  or  on  any 
of  Edge(i)’s  intersections  with  other  edges.  The  assumption  ensures  that  the  acceptance 
region  of  the  GLRT  contains  Oo-  For  such  an  *,  we  let  p*  =  P0’,.  then  (A8)  continues  to 
be  true,  ensuring  the  above-stated  relationship  between  Do  and  U |.  0 

The  test  that  accepts  Ho  iff  p  6  Ho  is  also  a  union-intersection  test.  In  this  case 
Pi  =  P0’,  1  <  *  <  K.  We  have  have  proven  the  following  theorem: 

THEOREM  3.  If  X (N)  is  the  test  statistic  for  the  GLRT,  the  acceptance  region  of  which 
contains  flo»  then  its  rejection  region,  Rs,  is  bounded  respectively  above  and  below  in 
terms  of  set  inclusion  by  the  rejection  regions  of  Union -  Intersection  Tests 

(1  t(k)  I Vm- 

i/=i  v=i 

0 
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